
(i) APPLICANT : 

Seidel, Christoph; Weinhues, Ursula-Henrike ; 
Schmitt, Urban; Motz, Manfred; Wiedmann, Michael; 
Upmeier, Barbara; Soutscheck, Erwin 



(ii) TITLE OF INVENTION: 

Recombinant antigen from the NS3 region of the hepatitis C virus 

(iii) NUMBER OF SEQUENCES: 9 



(iv) CORRESPONDENCE ADDRESS: 



(A) 


ADDRESSEE : 


Felfe & Lynch 


(B) 


STREET : 


805 Third Avenue 


(C) 


CITY: 


New York 


(D) 


STATE : 


New York 


(E) 


COUNTRY : 


USA 


(F) 


ZIP: 


10022 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch 1.44 Mb storage diskette 

(B) COMPUTER: IBM PS/2 

(C) OPERATING SYSTEM: PC-DOS 

(D) SOFTWARE: Patentln Relase #1.0, 

Version #1.25 (EPA) 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/511,759 

(B) FILING DATE: 7 -AUGUST- 1995y 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: p 44 28 705.4 

(B) FILING DATE: 12 AUGUST 1994 

(viii) ATTORNEY/AGENT INFORMATION 

(A) NAME: HANSON, NORMAN 

(B) REGISTRATION NUMBER: 30, 946 

(C) REFERENCE /DOCKET NUMBER: HUBR 1067.1 

(ix) TELECOMMUNICATION INFORMATION 

(A) TELEPHONE: 212-688-9200 

(B) TELEFAX: 212-838-3884 



(2) INFORMATION FOR SEP ID NO ; 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 885 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) INITIAL ORIGIN: 

(A) ORGANISM : hepatitis C virus 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME SEGMENT: NS3 

(ix) CHARACTERISTICS: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..885 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ATG ACC ATG ATT ACG AAT TCC CGG GGA TCC ATC ATG AAA TCC CCG GTG 
Met Thr Met lie Thr Asn Ser Arg Gly Ser lie Met Lys Ser Pro Val 
1 5 10 ~ 15 

TTC ACG GAT AAC TCC TCT CCA CCG GTA GTG' CCC CAG AGC TTC CAG GTG 
Phe Thr Asp Asn Ser Ser Pro Pro Val Val Pro Gin Ser Phe Gin Val 
20 25 30 

GCT CAC CTG CAT GCT CCC ACA GGC AGC GGC AAG AGC ACC AAG GTC CCG 
Ala His Leu His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro 
35 40 45. 

GCT GCA TAC GCA GCT CAG GGC TAC AAG GTG CTA GTG CTC AAC CCT TCT 
Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser 
50 55 60 

GTT GCT GCA ACA TTG GGC TTT GGT GCC TAC ATG TCC AAG GCT CAT GGG 
Val Ala Ala Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly 
65 70 75 80 

ATC GAT CCT AAC ATC AGG ACC GGG GTG AGA ACA ATT ACC ACT GGC AGC 
lie Asp Pro Asn lie Arg Thr Gly Val Arg Thr lie Thr Thr Gly Ser 

85 90 95 

CCC ATT ACG TAC TCC ACT TAC GGC AAG TTT CTT GCC GAC GGC GGG TGC 
Pro lie Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys 
100 ' 105 110 



GCA GGG GGT GCT TAT GAC ATA ATA ATT TGT GAC GAG TGC CAC TCC ACG 
Ala Gly Gly Ala Tyr Asp lie He He Cys Asp Glu Cys His Ser Thr 
115 120 125 

GAT GCC ACA TCC ATC TTG GGC ATC GGC ACT GTC CTT GAC CAA GGA GAG 
Asp Ala Thr Ser He Leu Gly He Gly Thr Val Leu Asp Gin Gly Glu 
130 135 140 

ACT GCG GGG GCG AAA TTG GTT GTG TTC GCC ACC GCC ACC CCT CCG GGC 
Thr Ala Gly Ala Lys Leu Val Val Phe Ala Thr Ala Thr Pro Pro Gly 
145 150 155 160 

TCC GTC ACT GTG CCC CAT CCC AAC ATT GAG GAG GTT GCT CTA TCC ACC 
Ser Val Thr Val Pro His Pro Asn He Glu Glu Val Ala Leu Ser Thr 

165 170 175 

ACC GGA GAG ATC CCT TTT TAC GGC AAG GCT ATC CCC CTT GAG GTA ATC 
Thr Gly Glu He Pro Phe Tyr Gly Lys Ala He Pro Leu Glu Val He 
180 185 190 

AAG GGG GGG AGA CAT CTC ATC TTC TGT CAT TCA AAG AGG AAG TGC GAT 
Lys Gly Gly Arg His Leu He Phe Cys His Ser Lys Arg Lys Cys Asp 
195 200 205 

GAG CTC GCC ACA AAG CTG GTC GCA ATG GGC ATC AAT GCC GTG GCC TAC 
Glu Leu Ala Thr Lys Leu Val Ala Met Gly He Asn Ala Val Ala Tyr 
210 215 220 

TAC CGC GGT CTT GAC GTG TCC GTC ATC CCG ACC AGC GGT GAT GTT GTC 
Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val Val 
225 230 235 " 240 

GTC GTG GCA ACC GAC GCC CTC ATG ACC GGC TAT ACC GGC GAC TTC GAC 
Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp 

245 250 J " 255 

TCG GTG ATA GAC TGC AAC ACG TGT GTC ACT CAG ACA GTC GAT TTC AGC 
Ser Val lie Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser 
260 265 270 

CTT GAC CCT ACC TTC ACC ATT GAG ACG ACC ACA CTT CCC CAG GAT GCT 
Leu Asp Pro Thr Phe Thr He Glu Thr Thr Thr Leu Pro Gin Asp Ala 
275 280 285 



GTC TCC CGC ACT CAA CGA CGG 
Val Ser Arg Thr Gin Arg Arg 
290 ~ 295 



r 



(3) INFORMATION FOR SEP ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Thr Met He Thr Asn Ser Arg Gly Ser He Met Lys Ser Pro Val 
1.5 10 ' 15 

Phe Thr Asp Asn Ser Ser Pro Pro Val Val Pro Gin Ser Phe Gin Val 
20 25 30 

Ala His Leu His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro 
35 40 45 

Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser 
50 55 60 

Val Ala Ala Thr' Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly 
65 70 75 80 

He Asp Pro Asn He Arg Thr Gly Val Arg Thr He Thr Thr Gly Ser 

85 90 95 

Pro He Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys 
100 105 110 

Ala Gly Gly Ala Tyr Asp He He He Cys Asp Glu Cys His Ser Thr 
115 120 125 

Asp Ala Thr Ser He Leu Gly He Gly Thr Val Leu Asp Gin Gly Glu 
130 135 140 

Thr Ala Gly Ala Lys Leu Val Val' Phe Ala Thr Ala Thr Pro Pro Gly 
145 150 155 160 

Ser Val Thr Val Pro His Pro Asn He Glu Glu Val Ala Leu Ser Thr 

165 170 175 

Thr Gly Glu He Pro Phe Tyr Gly Lys Ala He Pro Leu Glu Val He 
180 185 190 

Lys Gly Gly Arg His Leu He Phe Cys His Ser Lys Arg Lys Cys Asp 
195 200 205 



r 



Glu Leu Ala Thr Lys Leu Val Ala Met Gly lie Asn Ala Val Ala Tyr 



215 220 



Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val Val 

230 235 ' * 240 

Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp 



250 



255 



Ser Val lie Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser 
260 265 270 

Leu Asp Pro Thr Phe. Thr lie Glu Thr Thr Thr Leu Pro Gin Asp Ala 
275 280 . 285 

Val Ser Arg Thr Gin Arg Arg 
290 295 



(4) INFORMATION FOR SEP ID NO - 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: .SEQ ID NO: 3 
AAGGGATCCA TCATGAAATC CCCGGTGTTC ACGGATAACT 



(5) INFORMATION FOR SEP ID NO : 4. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TCPPLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE .DESCRIPTION: SEQ ID NO: 4 



GGGAAGCCTT AATTCTTACC GTCGTTGAGT GCGGGAGAC 



( 



(6) INFORMATION FOR SEP ID NO r 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic. acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

GAGGGATCCA TCATGAAAGC GGTGGACTTT ATCCCTGTG 



(7) INFORMATION FOR SEP ID NO : 



(i) SEQUENCE CHARACTERISTICS: 

(A) .LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

GAGAAGCTTT TAACACGTGT TGCAGTCTAT CAC 
(8) INFORMATION FOR SEP ID NO : 7 : 



(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

GAGGGATCCA TCATGAAACA CCTGCATGCT CCCACCGGC 



r 



(8) INFORMATION FOP fiRQ m 



NO; 



33 



(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GAGAAGCTTT TAATACCAAG CACAGCCTGC GTC 

(3) INFORMATION FOR SEQ ID NO: 9 : 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 3 02 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein ^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID N© • 9-* ^ ^[Vj^^ , 

J' ' qJJM ^ ^ j^-to/W* 

Pro Val Glu Asn Leu Glu Thr Thr Met Arg Ser Co Val Phe Thr ^ ^ 

5 15 
Asp Asn Ser Ser Pro Pro Val Val Pro Gin Ser Phe Gin Val Ala 

20 25 30 

His Leu His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro 

35 40 45 

Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro 

50 55 60 

Ser Val Ala Ala Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala 

65 7 0 75 

His Gly He Asp Pro Asn lie Arg Thr Gly Val Arg Thr lie Thr 

80 85 " 90 

Thr Gly Ser Pro lie Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala 

S5 100 



105 



c 



Asp Gly Gly Cys Ala Gly Gly Ala Tyr Asp He He 

HO " 115 

Glu Cys His Ser Thr Asp Ala Thr Ser lie Leu Gly 

12.5 130 

Val Leu Asp Gin Gly Glu Thr Ala Gly Ala Lys Leu 

140 145 

Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Val Pro 

155 160 

He Glu Glu Val Ala Leu Ser Thr Thr Gly Glu He 

170 175 

Gly Lys Ala He Pro Leu Glu Val He Lys Gly Gly 

185 190 

He Phe Cys His Ser Lys Arg Lys Cys Asp Glu Leu 

200 " 205 

Leu Val Ala Met Gly He Asn Ala Val Ala Tyr Tyr 

215 220 

Asp Val Ser Val He Pro Thr Ser Gly Asp Val Val 

230 235 

Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe 

245 " ~ 250 

He Asp Cys Asn Thr Cys Val Thr Gin Thr, Val Asp 

260 265 . 

Asp Pro Thr Phe Thr He Glu Thr Thr Thr Leu Pro 

275 280 

Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Arg 

290 / 295 

Gly lie 
302 



He Cys Asp 
120 

He Gly Thr 
135 

Val Val Phe 
150 

His Pro Asn 
165 

Pro Phe Tyr 
180 

Arg His Leu 
195 

Ala Thr Lys 
210 

Arg Gly Leu 
225 

Val Val Ala 
240- 

Asp Ser Val 
255 

Phe Ser Leu 
270 

Gin Asp Ala 
285 

Gly Lys Pro 
300 



